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CLASSIFICATION  OF  INTERMITTENT  DEPENDENT  OBSERVATIONS 

by 
P.  A.  Jacobs  and  D.  P.  Gaver 

1.     THE  PROBLEM 

Consider  the  following  classification  problem.  Suppose  there  are  J  items 
(e.g.,  diseases)  each  of  which  has  a  characteristic  Signature  which  varies  in 
time;  the  Signature  of  Item  i  is 

\l{t)  =  Qi  +  Xi{t)       1  =  1,2,...,/     f  =  0,1,2 (1.1) 

For  the  moment  {Xj(f)}  is  an  unspecified  multivariate  (or  univariate) 
stochastic  process,  but  one  that  stays  near  0,  in  finite  time  and  has  some 
stationary  or  steady-state  behavior.  In  many  cases,  paths  of  X,-(f)  will  appear 
somewhat  "continuous,"  so  successive  X,(f)'s  are  not  well-modeled  as  iid 
random  variables.  One  could  think  of  Y,-(f)  as  physical  indices  characteristic 
of  a  particular  disease,  e.g.,  blood  pressure,  heart-beat  pattern,  cholesterol 
levels.  Examples  from  equipment  reliability  are  also  of  interest;  here  physical 
indices  might  be  vibration,  variations  in  heat  level,  oil  leakage,  and  even  fuel 
consumption  in  the  case  of  engines. 

In  many  circumstances  Y,-(r)  is  only  observable  occasionally,  at  times 
unrelated  to  the  value  of  Y,-(f)  but  driven  by  other  forces  such  as  the 
scheduling  of  a  routine  physical  exam  or  system  inspection.  Suppose  that  the 
Signature  and  the  identity  of  the  item  associated  with  the  Signature  are  both 
observed  at  time  t  =  0,  on  such  an  occasion.  Suppose  that,  later  on,  however, 
only  the  Signature  of  an  item  is  observed.  The  first  question  is:  What  is  the 
probability  that,  given  the  Signature  value  observed,  its  originating  item  is 
any  particular  one  of  the  J  candidates? 

In  Gaver  and  Jacobs  [1989],  the  processes  (Xj(t)}  are  assumed  to  be 
univariate  Gaussian  and  a  Bayesian  classification  procedure  is  studied.   In  this 


paper,  Section  2  assumes  (X{(t)}  are  multivariate  normal  autoregressive 
processes.  In  Section  3,  {Xj(t)}is  a  univariate  Cauchy  autoregressive  process 
whose  marginal  distribution  has  longer  tails  than  the  Gaussian.  A  Bayesian 
classification  procedure  for  the  Cauchy  data  is  studied.  In  Section  4,  we  study 
the  behavior  of  the  univariate  Cauchy  and  Gaussian  classification  procedures 
when  autoregressive  data  having  the  wrong  marginal  distribution  are 
presented  to  them.  The  results  suggest  that  the  Gaussian  classification 
procedure  is  biased  towards  classifying  a  Signature  produced  at  time  t  as  being 
associated  with  the  same  item  that  produced  the  Signature  at  time  0.  The 
Cauchy  classification  procedure  is  biased  towards  classifying  a  Signature 
produced  at  time  t  as  being  associated  with  a  different  item  than  the  one 
producing  the  Signature  at  time  0.  These  effects  are  strongest  for  small  times 
t.  The  largest  number  of  misclassifications  occur  for  small  times  t  when  the 
Gaussian  classification  procedure  is  presented  with  Cauchy  data  and  a 
different  item  is  associated  with  the  Signature  at  time  t  than  the  item 
associated  with  the  Signature  at  time  0;  in  this  situation  the  Gaussian 
procedure  is  relatively  less  sensitive  to  the  change  in  the  item  associated  with 
the  Signature.  Misclassifications  by  the  Cauchy  classification  procedure  are 
modest  in  comparison  to  this  extreme  case. 

In  summary,  it  is  important  to  realize  that  the  performance  of  a  Bayesian 
classification  procedure  can  be  influenced  by  its  underlying  distributional 
assumptions.  A  classification  procedure  based  on  Gaussian  distributional 
assumptions  can  be  reluctant  to  classify  a  new  observation  coming  from  a 
different  item  as  being  associated  with  a  new  item.  A  classification  procedure 
based  on  Cauchy  distributional  assumptions  can  be  reluctant  to  classify  a  new 
observation  which  comes  from  the  same  item  as  that  being  associated  with 
the  same  item.  Hence,  if  there  is  uncertainty  about  the  underlying 
distribution  of  the  data,  it  might  be  better  to  combine  results  of  several 
classification  procedures  based  on  different  distributional  assumptions. 


2.     THE  MULTIVARIATE  NORMAL  CASE 

2.1  The  Classification  Question 

Assume  for  illustration  that  the  Signature  of  Item  j  is  multivariate  AR(1): 

\j(t)  =  oj  +  xj(t)  (2.1) 

where  9,,  Y,(f),  and  X.-(f)  are  d-dimensional  column  vectors.    The  process 
|x,(f)|  is  a  vector  AR(1)  process 

Xy(0  =  AyXy(*-l)  +  E;.(0  (2.2) 

where  Ay  is  a  dxd  matrix  and  JE.(f)|is  a  sequence  of  d-dimensional  column 

vectors   which   are   independent   multivariate   normal   with   mean   0  and 
variance-covariance  matrix  A;.    The  variance-covariance  matrix  for  X.(£  +  l) 

is 


rTt 


T 


rj'(f  +  l)  =  EX(f  +  l)X'(/  +  l)  =Ayr;-(f)Aj  +Aj.  (2.3) 

We  will  assume  A,  and  A,  are  such  that  there  is  a  finite  unique  solution  to 
the  equation 

ry  =  A/r/AT  +  A/.  (2.4) 

Assume  X,(0)  has  a  normal  distribution  with  mean  0  and  variance- 
covariance  matrix  r,.  It  follows  that  <X;(fU  is  a  stationary  sequence  with 
mean  0  and  variance-covariance  matrix  r,. 

The  conditional  distribution  of  X.-(f)   given   X.-(O)  =  x    is    multivariate 

normal  with  mean  Ayx  and  variance-covariance  matrix 


Aj{t)=±AJAj(A]f.  (2.5) 


Thus,  r.  =  lim  A. -(f) 


t— >°° 


The  conditional   distribution  of  the  actually  observable  Y.-(f)    given 
Yj(0)=y(0)  is  multivariate  normal  with  mean  8j+Aj(y(0)-8j)  and  variance- 

covariance  matrix  A;(t). 

Operational  Scenario:  There  are,  potentially,  J  items.  Let  C(t)  be  the  identity 
of  the  item  whose  Signature  is  observed  at  time  t.  Put  pj(t)  =  P{C(t)=j}. 
Assume  that  it  is  known  that  the  Signature  observed  at  time  0  comes  from 
Item  i;  that  is,  C(0)  =  i  and  Y(0)  =  Y,(0)  =  y(0).  If  it  has  been  a  long  time  since  a 
Signature  from  item  i  has  been  observed,  it  is  reasonable  to  suppose  that 

P{C(0)  =  i,Y(0)  =  y(0)} 

=  p,(o)((2^|r,l)"0'5exp{4(y(0)-el)Trr1(y(°)-e.)}        (2-6) 


the  long-run  or  steady-state  distribution.    Further, 
P{C(f)  =  i,  Y(0  =  y(0|C(0)  =  i,Y(0)  =  y(0)} 

=  p;(t)[(2^|S,(t)|]"0'5exp{-I(y(0-mi(0)rE,(tr1(y(0-m.W)}     (2.7) 

where 

mI-(0  =  eI-  +  A{(y(0)-eI-)  (2.8) 

and 

S,W  =  Al(f).  (2.9) 


For  j  *  i,  we  will  assume  the  conditional  distribution  of  Y(t)  given  C(0)  =  i, 
Y(0)  =  y(0),  C(t)  =  j  is  multivariate  normal  with  mean  mj(t)  =  9j  and  variance- 
covariance  matrix  Tj  =  £j(t),  since  it  is  still  a  long  time  since  a  Signature  from 
Item  j  is  observed. 

It  now  follows  that 


P{Y(0  e  dy(t)\C(0)  =  i,Y(0)  =  y(0)} 
j  -0.5 

=  IP;(0[(2^|£7(0|]        expl-^yW-myWfzyW-^yW-myfO)}.  (2-10) 


Thus,  the  posterior  probability  of  the  identity  of  the  item  associated  with 
the  Signature  is 

P{C(t)  =  ;|C(0)  =  i,Y(0)  =  y(0),Y(t)  =  y(t)}  = 
^(Ol^W^'^xpj-ifylO-m^of^^r^ylO-m^O)} 

I^COl^COr^expj-^yW-m^Of^arVlO-m.W)! 


fc-1 


2.2  The  Probability  of  an  Incorrect  Classification 

In  this  section  we  assume  that  the  item  that  is  associated  with  the 
Signature  at  time  t  given  the  last  complete  observation  at  time  0  will  be 
estimated  to  be  that  one  which  maximizes  the  posterior  probability  (2.11). 

For  a  simple  illustration  we  will  suppose  that  there  are  only  J=2  possible 
items  with  known  parameters  8]  and  62- 

Given  Y(0)=y(0),  CXOM,  and  C(t)  =  1,  the  conditional  distribution  of  Y(t)  is 
multivariate  normal  with  mean 


m1(0  =  e1  +  A{(y(0)-G1)  (2.12) 

and  variance-covariance  matrix 

n 

fc=0 

Let  the  matrices  Hj(t)  and  Hj  be  such  that 


*l(0=X(Af)  AiAf.  (2.13) 


Hj  (t)Ui{t)T  =  Z,(0  (2-14) 

and 

HjHjT=rj.  (2.15) 

It  follows  that 

Yf^to^O  +  HtWU  (2.16) 

where  U  is  a  d-dimensional  column  vector  each  of  whose  components  are 
independent  standard  normal  random  variables;  the  notation  =  means  equal 
in  distribution.  Thus,  given  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  1, 

(YW-m^OrinW^MO-miW^U  (2.17) 

and 

(Y(0-m2(0)T2:2(f)-1(Y(0-m2(0) 

=(mi(0-m2(0  +  H1(0U)T2:2(0-1(m1(0-m2(0  +  H1(0U)  (2.18) 

where  m2(t)  =  02  and  Z2(t)  =  T2.   Thus,  the  probability  of  a  misclassification  is 


PJclassify  the  item  as  2|C(0 1  =  1,  Y(0)  =  y(0),C(f)  =  l} 

=  p|p2(f)|2;2(')r0'5exp{-^(m1(f)-m2(l)  +  H,(()U)7£2(fr1(m1(f)-m2(()+H1(f)U) 


>p,(()|5:,«)r0'5e'<p{4ljTu} 


_  pfpjWf N0[ 


=  PMj(^f)  2  >  ^{4UTU  +  i(a(0+  Hl(()U)Tr2'(a(0  +  Hl(»)U)}  (2.19) 


where 


a(0  =  e1+Ai(y(o)-e1)-e2. 

Example:  Assume  A,  =  A,  A,  =  A  for  i  =  1,  2,  and  pi(t)  =  p2(t);  then 
P{wrong  classification  |C(0)  =  1,  Y(0)  =  y{0),C(t)  =  l} 


(2.20) 


=  P 


.  Irl  y 


>  exp{-Vu  +  l(a(0  +  HtOufr^t)  +  HflU)} 


(2.21) 


where 


t-\ 

X 

k=0 


a(0=Ia*a(a*)   -H(0H(0T; 


r  =  lim  A(t)  =  HH     is  the  solution  to  the  equation 

t— >°° 


and 


r  =  ArAy  +A; 


a(0  =  e1  +  A,(y(o)-e1)-e2. 


Note  that  as  t— >«> 


P{ wrong  classification  |C(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 

-^p|i>expj--uTu+-(e1-e2  +  Hu)Tr"1(e1-e2  +  HU) 

\-l 


=  p  i>exPU(e1-e2)T(HHT)   (e1-e2)+(e1-e2)T(HT)   u|| 

=  p|-i(9i  -e2)T(HHT)"1(01  -02)  >  (6!  -e2)T(HT)"1u|.  (2.22) 

A  Simulation 

Table  1  gives  the  results  of  a  simulation  experiment  for  the  case  8i  =  (1,1  )T 


A  = 


and 


A  = 


In  this  case, 


r  = 


0.1     0.9 
0.4    0.5 


1     0 
0     1 


5.4     3.8 
3.8     4.5 


Figure  1  shows  contours  from  a  bivariate  normal  distribution  having  mean 
01  and  variance-covariance  matrix  I". 

In  each  replication  two  independent  vector  random  variables  are 
generated;  one  is  Y(0)  which  has  a  normal  distribution  with  mean  0]  and 
variance-covariance  matrix  I*;  the  other  is  U,  whose  components  are  two 
independent  standard  normal  random  variables.  For  each  time  t  =  1,  2,  ...,  40, 
Y(t)  is  calculated  as 


Y(t)  =  m(t)  +  H(t)U. 


(2.23) 


with  m(t)  =  9i  +  A1  (y(0)  -  0i);  Y(t)  has  the  same  distribution  as  a  Signature 
from  Item  1  when  the  Signature  at  time  0  is  also  from  Item  1.  There  are  1000 
replications.  Table  1  presents  the  fraction  of  replications  for  which  the 
incorrect  classification  is  made  of  Item  2  being  the  one  producing  the 
Signature  at  time  t;  that  is  those  replications  for  which 
1 

^]2>expg(y(0-e2)Tr-1(y(e)-e2)-i(y(0-e1)TAW-1(yW-e1^ 

(2.24) 

Note  that  the  fractions  are  not  independent  since  common  random  numbers 
are  used. 

The  contours  of  the  distribution  in  Figure  1  suggest  that  it  is  more  likely 
to  make  a  misclassification  if  02  =  (2,2)T  than  if  02  =  (-2,2)T;  the  fractions  in 
Table  A  support  this.  The  fractions  in  Table  A  also  suggest  that  the  probability 
of  misclassification  is  an  increasing  function  of  t.  This  observation  is 
supported  by  the  fact  that  the  variances  of  the  components  of  Y(t)  increase  as  t 
increases. 

TABLE  A.  FRACTION  OF  MISCLASSIFICATION 

0i  =  (1,DT 


Time: 

1 

2 

3 

4 

5 

10 

20 

30 

40 

02  =  (2,2F 

0.10 

0.13 

0.16 

0.20 

0.21 

0.30 

0.39 

0.41 

0.41 

02  =  (-2,2)T 

0.04 

0.06 

0.06 

0.07 

0.07 

0.09 

0.09 

0.09 

0.09 
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3.     CAUCHY  UNIVARIATE  MODEL 

In  this  section  we  consider  Bayesian  classification  for  a  time  series  model 
having  marginal  distributions  with  a  longer  tail  than  the  Gaussian 
distribution. 

We  assume  that 

Yj(t)  =  9j  +  Xi(t) 

with 

Xi(t)  =  piXi(t-l)  +  ei(t) 

where  I  pi  I  <1;  {ei(t)}  are  independent  sequences  of  independent  identically 
distributed  Cauchy  random  variables  with  location  parameter  0  and 
precisions  [(l  —  Jpjj)«/1  ;  and  Xj(0)  has  a  Cauchy  distribution  with  parameters 
0  and  a.  Under  these  assumptions  {Xj(t);  t  =  0,  1,  2,  ...}  is  a  stationary 

sequence  of  random  variables  with  marginal  Cauchy  distribution  having 

i     -°-5 
parameters  0  and  (Xj     . 

It  follows  that 
P{Yi[0)^dy{0)fYi{t)^dyi{t)} 


1 


a; 


n 


"«?  +(y(0)-  OifJ^afc-lPif  )  (a,(l- W*))2  +  (vMt  %  ~pj(y(0)-  %))' 


n-1 


(3.1) 


Let  C(t)  denote  the  identity  of  the  item  associated  with  the  Signature  at 
time  t  and  put  P{C(t)=i}=pi(t);  then 

P{Y(t)  e  dy(t),C{t)  =  i|C(0)  =  f,Y(0)  =  y(0)} 

=  w(0^«,{i-|Ai')  («i(i-|p/f))  +(y(O-0i-p-(y(o)-»,-))J 


10 


=  ^(0-afwkw2+W0-^W)2 

K  L 


(3.2) 


Thus, 


l 


P{Y(t)  e  dy(t)\C(0)  =  i,Y(0)  =  y(0)}  -  Ip/M^oUm*  +  (</(')- ™/(0) 

where    (Xi(t)  and  mj(t)  are  defined  in   (3.2)  and  it  is  natural  to  define 
mdt)  =  Oj  and  a.-(f)  -  «;  for;  *  i.     Hence,  given  item  i  is  associated  with  the 

Signature  at  time  0,  the  posterior  probability  that  item  j  is  associated  with  the 
Signature  observed  at  time  t  is 

p{C(t)  =  ;|C(0)  =  i,Y(0)  =  y(0),Y(0  =  y(t)}  (3.3) 

=  Pj(t)ccj(t)ccj(t)2+(y(t)-mj(tjf 


j^._.,.L.,.2. 2-1 


x  Ep*('K(w')  +(yW-***W) 

U=i 


-i 


3.2  The  Probability  of  Making  an  Incorrect  Classification 

In  this  section  we  assume  that  the  item  associated  with  the  Signature  at 
time  t  given  the  last  complete  observation  at  time  0  is  estimated  to  be  that  one 
which  maximizes  the  posterior  probability  (3.3).  For  simplicity  we  will 
suppose  there  are  J=2  possible  items  with  known  parameters  0i  and  02- 

First 


p{Y(t)  e  dy(t)\Y(0)  =  y(0),C(0)  =  l,C(f)  =  ;} 


=  ^aj(t)\(aj(t))\(y(t)-mj(t)f 


-1 


where 


(3.4) 
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a1(0  =  «i[l-|pif];a2(0  =  «2  (3-5) 

Mt)  =  0\  +p\{y(0)-^Y>™2(t)  =  02.  (3.6) 

Note  that  given  Y(0)  =  y(0),  C(0)  =  1  and  C(t)  =  1, 

y(*)2[0i  +p|(y(o)-ei)]+(i-|pif  )«iw  ^  m^j+aiWw 

where  W  is  a  Cauchy  random  variable  with  location  parameter  0  and 
precision  1,  Hence,  the  probability  of  making  the  incorrect  classification  of 
estimating  Item  2  as  being  associated  with  the  Signature  at  time  t  given  Item  1 
is  responsible  for  Signatures  at  time  0  and  time  t  and  Y(0)  =  y(0)  is 

P{Classify  as  Item  2|C(0)  =  1,  Y(0)  -  y(0),C(t)  =  l} 


=  P 


P2(0 

nit)' 


«2(0[«2(02  +(y(0-»z)2  J"1  >  «i(*)[ai(02 +(Y(0-m1(t))2p1|c(°)  =  i^(o)  -  y(o),c(t)  = 

of  +  (Wl(0  +  ai(t)W  -  02)2  J"1  >  [ai(tf  +(ai(OW)2]_1|C(0)  =  1, Y(0)  =  y(0),C(t)  =  1 


p2(f)  a2(0 


=  p 


pi(0  «i(0 

P2(0 


w(0 


«2(0«l(0  > 


a2  +  (Wl(()  +  ai(f)vv-02)2] 


1  +  W 


|C(0)  =  l,Y(0)  =  y(0)/C(0  =  l 


(3.7) 


Note  that  as  t—»0,  ai(t)— >0,  and  mi(t)— >y(0).   Hence,  the  conditional  probability 
of  a  wrong  classification  tends  to 


0> 


<*l  +  (y(o)-e2y 


1  +  WJ 


=  0. 


(3.8) 


As  t— »<*>,  oti(t)— >cii,  mi(t)— >0i  and  the  conditional  probability  of  a  wrong 
classification  tends  to 
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p\E^a2a^l  +  W2)>[al  +  (alW  +  dl-e2)2t.  (3.9) 


IpiH 

If  a2  =  ai  =  a  and  p2(°°)  =  pi(°°),  then  as  t— >«> 
Pfincorrect  classification! Y(0)  =  y(0),C(0)  =  l,C{t)  =  l} 


=  P 


a2(l  +  W2)>a: 


1  + 


VV  + 


(Ol-Ol)' 


=  p\wjL> 


w 


(9,-92) 


a 

n2l 


2\\ 


J) 


=  P\W> 


a 
9,  -92 


la 


which  increases  as  a  increases  and  decreases  as  1 Q\  -  62 1  increases. 


(3.10) 
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4.     ARE  BAYESIAN  CLASSIFICATION  PROCEDURES  ROBUST? 

In  this  section  the  robustness  of  the  univariate  Cauchy  and  Gaussian 
classification  procedures  against  misspecification  of  the  form  of  the  marginal 
distribution  will  be  studied. 

4.1  Gaussian  Data. 

In  this  subsection  we  assume  that  the  Signatures  of  the  Items  form 
Gaussian  time  series.   In  particular  we  assume  that 

Yi(t)  =  6i  +  Xi(t)  (4.1) 

with 

Xi(t+l)  =  piXi(t)  +  £i(t)  (4.2) 

where  {ej(t)}  are  independent  identically  distributed  normal  random  variables 

2  I        I 

with  mean  0  and  variance  o{  and  |  pi  I  <  1.  The  independent  random  variable 
X  i(0)  has  a  normal  distribution  with  mean  0  and  variance 
ai(°°)  =  °\ \  ly-~P\ [)■  Thus  {Xi(t),t>0}  is  a  stationary  sequence  of  normal 
random  variables  with  mean  0  and  variance  Gi(°°)2.  Let  C(t)  be  the  identity  of 
the  Item  associated  with  the  Signature  at  time  t. 

As  was  shown  in  Gaver  and  Jacobs  (1989),  the  conditional  distribution  of 
Y(t)  given  Y(0)=y(0),  C(0)=i,  C(t)=i  is  normal  with  mean 

m1(t)  =  dl+(y(0)-dl)pj  (4.3) 

and  standard  deviation 


cJi(t)  =  oiHJl-pft.  (4.4) 
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For  simplicity  we  will  assume  P{C(t)  =  i}  =  pj(t)  =  p(t)  and  there  are  2  Items 
with  parameters  81  and  02;  thus,  p(t)  =  2 

Suppose  the  Cauchy  procedure  is  used  to  estimate  the  identity  of  the  Item 
associated  with  the  Signature  at  time  t;  that  is,  the  Item  which  maximizes  the 
posterior  probability  (3.3)  is  the  estimate  of  the  Item  associated  with  the 
Signature.  Hence,  the  probability  of  an  incorrect  classification  is 

PJClassify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 


4^[«2(O2H(O  +  o-1(f)z-02)2]"1>[«1(O2  +  (o-1(Oz)2]~1 


=  P 


a2(t)a1(t^a2(t)2  +  (ml(t)  +  cl(t)Z-e2)2] 


!  ,  (<H(*)Z)' 

«i(0 


Til 


=  P 


a2aifl-|PlfI"2W2  +  (^lW  +  o-lWZ-02): 


-1-1 


> 


1  + 


°\ 


H^ 


p\ 


it 


«i(i-hf 


(4.5) 


where  Z  is  a  standard  normal  random  variable. 
Note  that  as  t->0 

PJClassify  as  Item  2|C(0)  =  1,Y(0)  =  y{0),C{t)  =  l} 


=  P 


«2«l(l-|Plf)[«2+(ml(f)  +  CTl(0Z-^2)2] 


1  + 


n2 


hfLz* 


i-hl' 
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=  p|a2a1(l-|p1f)[a|  +  (m1(0  +  cri(0Z-«2)2 


1  + 


°"iH 

L  "i 


HpiI  Z2 
HpiI' 


-ii 


=  p 


«1«2 


2 
(l-|pi|')[a2  +  (m1(0  +  a1(t)Z-a2)2p1>(l-|p1|')(l-|p1|')+  ^H    (l+N*) 


-1 


=  P 


ccl+(m^t)  +  G^t)Z-e2y 


<  a\a.2 


HPl      + 


«i 


n2 


i+|pi|')z 


->P^ 


ai«2  L 


«22+(y(o)-e2r 


< 


°"lH 
«1 


2Z' 


p) — Lra2+(y(0M2), 


21  <Z2^. 


(4.6) 


Thus,  the  conditional  probability  of  an  incorrect  classification  does  not  tend  to 
0  as  t— > 00  as  it  would  if  the  correct  model  were  used;  see  Gaver  and  Jacobs 
(1989)  (3.6). 

Note  that  as  t— >©° 

P{Classify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 


=  P 


«2al 


al+(e^  +  a^{oo)z-e2y 


-1 


> 


1  + 


V     «1 


-1 


(4.7) 


If  ai  =  0C2  =  1/  then  the  above  equals 


l  +  (o-1Hz  +  01-02)        >  l  +  fa^oojz) 
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=  p[(cj1Kz)2>(ct1hz+01-02)2} 


2    (     (Oi-Oi)^2 

P{Z2>  z  +  y 


o-iH 


which  is  the  same  as  if  the  correct  model  had  been  used  to  make  the  decision; 
see  (3.9)  of  Gaver  and  Jacobs  (1989). 

Now  we  consider  the  case  in  which  a  different  Item  is  associated  with  the 
Signature  at  time  t  than  the  one  associated  with  the  Signature  at  time  0.  Once 
again  for  simplicity  we  assume  0\  =  02  =  o,  pi  =  P2  =  p  with  |  p  |  <  1  and  for  the 
Cauchy  model  oci  =  0:2  =  a.  Let  a(°°)  =  a/  Vl-p2  and  o(t)  =  o~(°°)Vl-p2t.  We 
will  assume  Item  1  is  associated  with  the  Signature  at  time  0  and  Item  2  is 
associated  with  the  Signature  at  time  t. 

For  the  Gaussian  classification  procedure  of  Gaver  and  Jacobs  (1989),  the 
probability  of  an  incorrect  classification  is 
P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{(l-p2t)"1/2exp{-2  [e2  +  a(oo  )Z-(9i+  (y(0Mi)p0]2/a(oo)2(i-p2t)} 

>exp{-2  (62+0(00  )Z-62)  /a(oo)  }} 
=P{-2  In  (i-p2t)_-[e2  +  0(00  )Z-(9i+  (y(0)-61)pt)]2/{o(oo)2(l-p2t)} 
>~2  (0(00  )Z)2/o(oo)2} 

=P{  (l-p2t)o(oo)   In  (l-p2t)  +[92  +  0(00  )Z-(0!+  (y(0)-G1)pt)] 

<(l-p2t)  (o(oo)Z)2}  (4.9) 
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where  Z  is  a  standard  normal  random  variable.   As  t-*0 

P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2}  ->  P  |  (z  +    ^^  1  <  0  \  =  0;    (4.10) 

that  is,  if  the  Item  associated  with  the  Signature  at  time  t  is  different  than  the 
one  associated  with  the  Signature  at  time  0,  then  as  t— »0,  the  probability  of  an 
incorrect  classification  using  the  Gaussian  procedure  on  Gaussian  data  tends 
toO. 

As  t->°°,  the  probability  of  an  incorrect  classification, 

P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

-{(z^J<  +  p{z<%^). 

Suppose  now  the  Cauchy  classification  procedure  is  used  on  the  Gaussian 
data  with  Item  2  associated  with  the  Signature  at  time  t  and  Item  1  associated 
with  the  Signature  at  time  0.    The  probability  of  an  incorrect  classification 
P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

2"1  2"1 

=P{a(l-|p!t)[[a(l-|p|t)]2+[92+o(oo)Z-[01+(y(O)-e1)ptl]  ]    >a[a2+(82+o(oo)Z-e2)  1    1 

=P{(1- 1  p  I  *))  [a2  +  (a(oo)Z)2]>[(a(l- 1  p  I  *))2  +  [62  +  a(~  )Z  -  [6i  +  (y(0)-ei)p*]]2]}.  (4.12) 

As  t-^0 

Pfclassify  as  1  I  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

->P{0>[62  +  o(~)Z  -  y(0)]  }  =  0;  (4.13) 
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that  is,  as  t— >0,  the  probability  of  a  correct  classification  for  the  Cauchy 
procedure  tends  to  1  for  the  case  in  which  the  Item  associated  with  the 
Signature  at  time  t  is  different  from  the  one  associated  with  the  Signature  at 
time  0,  even  though  the  data  are  Gaussian. 

As  t  — >  oo 
Pfclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 


Z2> 


02-01  fl     r       l02-0i. 


Hence,  as  t— >°°  the  probability  of  an  incorrect  identification  tends  to  the  same 
normal  tail  probability  for  both  the  Cauchy  and  Gaussian  classification 
procedures. 

Thus,  for  the  two  limiting  cases  t— >0  and  t— >°°,  both  the  Cauchy  and 
Gaussian  procedures  have  the  same  misclassification  probabilities  for  the 
scenario  in  which  the  Item  associated  with  the  Signature  at  time  t  is  different 
than  the  one  associated  with  the  Signature  at  time  0.  Note  that  these  are 
theoretical  limiting  results  with  all  parameters  known. 

To  investigate  further  the  behavior  of  the  two  classification  procedures 
on  Gaussian  data  when  Item  1  is  associated  with  the  Signature  at  time  0  and 
Item  2  is  associated  with  the  Signature  at  time  t,  let 

gt(y(0),Z)  =  (e2+a(~)Z-[01+(y(O)-e1)pt])2. 

The  conditional  probability  of  an  incorrect  classification  by  the  Gaussian 
procedure  is  from  (4.9) 
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PJClassify  as  Item  l|Y(0)  =  y(0),C(0)  =  l,C(t)  =  2} 

=P{(l-p2t)o(oo)2  ln(l-p2t)  <  (l_p2t)  (o(oo)Z)2  -gt(y(0),Z)} 
>P{(l-p2t)a(oo)2  ln(l-p2t)  <  (l- 1  p  1 1)  (a(oo)Z)2  -gt(y(0),Z)} 
>P{-a2(l- 1  p  1 1)  |  p  1 1  <  (l- 1  p  1 1)  (o(oo)Z)2  -gt(y(0),Z)} 

for  t  sufficiently  close  to  0.     From  (4.12)  it  follows  that  the  conditional 

probability  of  misclassification  for  the  Cauchy  procedure  is 

P{(1- 1  p  |  t)(o(oo)Z)2-gt(y(0),Z)  >  a2(l- 1  p  I  *)  [1—  I  p  I M]}. 

Hence  for  t  sufficiently  small,  the  incorrect  Cauchy  procedure  will  tend  to 
have  fewer  misclassifications  than  the  Gaussian  procedure  applied  to 
Gaussian  data  in  the  scenario  in  which  different  Items  are  producing  the 
Signatures  at  time  0  and  t. 

4.2  Cauchy  Data 

In  this  subsection  we  assume  the  Signatures  form  time  series  with 
Cauchy  marginal  distributions  as  in  Section  3.   In  particular,  we  assume  that 

Yi(t)  =  6i  +  Xj(t)  (4.15) 

with 

Xi(t+l)  =  piXi(t)  +  Ei(t)  (4.16) 

where  {ej(t)}  are  independent  identically  distributed  Cauchy  random  variables 
with  location  0  and  precision  [(1- 1  p;  |  )otj]-°-5  with  |  pj  |  <  1.   The  independent 

random  variable  Xj(0)  has  a  Cauchy  distribution  with  location  0  and  precision 

-1/2 

a,      .  Under  these  assumptions  (Xj(t)}  is  a  stationary  sequence  of  random 

-1/2 

variables  with  marginal  Cauchy  distribution  having  parameters  0  and  a; 

Further,  the  conditional  distribution  of  Y(t)  given  Y(0)  =  y(0),  C(0)  =  i,  C(t)  =  i 
is  Cauchy  with  location  parameter 
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mi(t)  =  ei  +  p;(y(o)-ei) 


(4.17) 


and  precision  parameter  (Xi(t)-1/2  with 

ai(t)  =  ai(l-|pi|t). 


(4.18) 


Let  C(t)  be  the  identity  of  the  Item  associated  with  the  Signature  at  time  t. 

For  simplicity  we  will  assume  there  are  two  items  with  parameters  61  and 
62.  Further  P{C(t)  =  i)  =  ps(t)  =  p(t). 

Suppose  the  Gaussian  procedure  of  Gaver  and  Jacobs  (1989)  is  used  to 
estimate  the  identity  of  the  item  associated  with  the  Signature  at  time  t;  that 
is,  the  item  which  maximizes  (2.12)  of  Gaver  and  Jacobs  (1989)  is  the  estimate 
of  the  Item  associated  with  the  Signature  at  time  t.  Hence,  the  probability  of 
an  incorrect  classification  is 


PJClassify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(0  =  l} 


=  P 


(72H 


exp 


(v(t)-e2) 

CT2H 


>  > 


*i(0 


exp 


y(t)-m(t) 


c(o)  =  1, 

Y(0)  =  y(0), 
C(t)  =  1 


(4.19) 


where 


m1(t)  =  61+p\(y(0)-el) 


(T1(0=<T1(oo)Jl-p1 


2t 


(4.20) 
(4.21) 


01H  =  o-i  /  $-fi);otH  =  °i  I  ^}-p\)  (4-22) 
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and   (Ji,  i   =   1,  2   are   the   assumed   standard   deviations   of  the   normal 
distributions.   We  will  assume  G\  =  02  =  o;  Pi  =  P2  =  P;  <*i  =  ot2.   Hence, 

PJClassify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 


O(oo) 


exp 


If  m^O+a^W-^ 


o-H 


^^)expr2 


«(t)w 

o(0 


(4.23) 


where  W  is  a  Cauchy  random  variable  with  location  parameter  0  and 
precision  1. 

P{Classify  as  Item  2|C(0)  =  1,7(0)  =  y(0),C(f)  =  l} 


=  P 


4^ 


>  exp 


a 


HpI' 


(i-p2t)oH: 


w 


2,  1 


(p*(y(Q)-gi)+(fli-g2)+«(Qw)' 


a(~)' 


=  p 


^- 


p      >  exp< 


a 


HpI']     2    i(pf(y(o)-0i)+(ei-02)+«(Owy 


(l  +  |pf)crH 


„  w2  + 

2  2 


a(~)' 


(4.24) 


f->0 


->P- 


0  >  exp 


l(y(O)-02r 


O(oo)^ 


=  0; 


thus,  the  probability  of  misclassification  tends  to  zero  as  t— >0  even  though  the 
incorrect  model  is  being  used;  the  correct  Cauchy  procedure  also  has  a 
probability  of  misclassification  tending  to  zero  as  t— »0.  As  t— >°° 

PJClassify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 


=  P< 


la2W2  il[81-e2  +  aW\ 

2  o-H2      2  o-(oo)2 
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=  p 


'^"7^2  f(^i-02)2+  2(d,-62)aW 


pjo  >  -(0!  -  d2f  +  (#1  +  02)«W 


=  PjlV<-II|e]-02| 


which  is  the  same  as  (3.10)  the  corresponding  probability  when  the  correct 
Cauchy  procedure  is  used. 

To  further  explore  the  behavior  as  t— >0,  let 

2 


gt(W,y(0))  =  [pKy(O)-ei)  +  (9i-e2)  +  a(l- 1  p  I  *)W] 


and 


B(t)  =  a2(l-|ph)W  . 
For  t  small  (4.24)  becomes 
PJClassify  as  Item  2|V(0)  =  y(0),C(0)  =  l,C(f)  =  l} 

=P{a(oo)2(i+  |  p  1 1)  in(l-p2t)  +  B(t)  >  (1+  |  p  |  *)  gt(W,y(0))} 
<P{-a2 1  p  |  t+B(t)  >  (1+ 1  p  I  OgtCW^O))} 
<P{-a2 1  p  I  »+B(t)  >  gt(W,y(0))} 
=P{a2(l- 1  p  | »)  +B(t)  >  a2+gt(W/y(0))} 

=P{(a(l- 1  p  1 4))2  +  (1- 1  p  1 0  B(t)  >  (1- 1  p  I l)  [a2+gt(W,y(0))]} 


=P{a[a2+gt(W,y(0))]_1  >  a(l- 1  p  | »)  [[a(l- 1  p  I  of  +  (a(l- 1  p  I  *)W)  ]    } 
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which  is  the  conditional  probability  of  misclassification  for  the  Cauchy 
procedure  on  Cauchy  data.  Hence  for  small  t,  the  incorrect  Gaussian 
procedure  will  tend  to  have  fewer  misclassifications  than  the  correct  Cauchy 
procedure  for  the  scenario  in  which  the  same  item  is  associated  with  the 
Signature  at  both  times. 

Now  we  consider  the  case  in  which  the  Item  associated  with   the 
Signature  at  time  t  is  different  than  the  one  associated  with  the  Signature  at 

time  0.    Once  again  for  simplicity  we  assume  oti  =  cc2  =  a,  pi  =  p2  =  p  with 

i     i  °-5 

I  p  I  <1  and  for  the  Gaussian  model  Gi  =  a2  =  °-   Let  a(°°)  =  a/(l-p2)      and 

0.5 

o(t)  =  a(°o)  (l-p2t)  .  We  will  assume  Item  1  is  associated  with  the  Signature 
at  time  0  and  Item  2  is  associated  with  the  Signature  at  time  t. 

For  the  Gaussian  classification  procedure  of  Gaver  and  Jacobs  (1989)  the 
probability  of  an  incorrect  classification  is 
Plclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{(l-p2t)^ 5  exp{-2  [G2  +  aW-  (Q^  +  p*(y(0)-ei))]2/a(~  Ml-p2')} 

1  22 

>exp{-2<e2+aW-e2)  /o(°°)  }} 

=P{~2  In  (l-p2t)  _  -  [e2  +  aW  -  (0]  +  pt(y(0)-81))]2/{o(oo  Ml-p*)} 

>-2(aW)2/o(~)2} 
=P{a(oo  )  (i-p2t )  in  (i_p2t )  +(e2  +  aW-Oi  +  pKyCOJ-Bi)))  (4.26) 

<(aW)  (l-p2t)J 
where  W  is  a  standard  Cauchy  random  variable. 

As  t->0,  the  probability  of  an  incorrect  classification, 
P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 
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->P{(82+aW-y(G))   <0}  =  0. 

Hence  as  t— »0,  the  probability  of  an  incorrect  classification  tends  to  0  for  the 
Gaussian  procedure  on  Cauchy  data. 

As  t— >«>,  the  probability  of  an  incorrect  decision 

P{classify  as  1 1  Y(0)  =  y(G),  C(0)  =  1,  C(t)  =  2}->P{(62  +  aW-GO  <(aW)  } 

62-61    2  2  1 62-61  I 

=P{(W+^-L)   <W   }  =  P(W>     \a     }■ 

Suppose  now  the  Cauchy  classification  procedure  is  used  on  the  Cauchy 
data  with  Item  2  associated  with  the  Signature  at  time  t  and  Item  1  associated 
with  the  Signature  at  time  0.   The  probability  of  an  incorrect  classification  is 
P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{cc(l- 1  p  I  l)[a2(l- 1  p  I l)2  +  [ez+aW^ei+pHyCGJ-ei))]2]"1 

>a[a2  +  (62  +aW-62)  ]"  } 
=P{(l-|ph[a2  +  (aW)  ] 

>[a2(l- 1  p  1 1)2  +  [e2+aW-(61+pKy(0)-61))]2]}.  (4.27) 


2 


As  t— >0,  the  probability  of  an  incorrect  classification 

P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  \,  C(t)  =  2}  ->  P{0>(62+aW  -  y(0))  }  =  0. 

Thus,   the   probability   of  an   incorrect   identification   using   the  Cauchy 
procedure  tends  to  0  as  t-»0  for  the  case  in  which  the  Item  associated  with  the 
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Signature  at  time  t  is  different  from  the  Item  associated  with  the  Signature  at 
time  0. 

As  t— >«>,  the  probability  of  an  incorrect  identification 

Pfclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

2  (62-61)  2  1 82-61  I 

->P{W   >(W  +  Lj^)}  =  P{W>      \au} 

the  same  as  for  the  Gaussian  procedure. 

Hence,  for  the  two  limiting  cases  t— >0  and  t— >«>  both  the  Cauchy  and 
Gaussian  procedures  have  the  same  misclassification  probabilities  for  the  case 
in  which  the  Item  associated  with  the  Signature  at  time  t  is  different  than  the 
one  associated  with  the  Signature  at  time  0.  Note  these  are  theoretical 
limiting  results  with  all  parameters  known. 

To  further  explore  the  differences  between  the  Gaussian  and  Cauchy 
procedures  for  the  scenario  of  different  Items  associated  with  Signatures  and 
Cauchy  data,  let 

gt(W,y(0))  =  [<xW  +  62-8i-pt(y(0)-6i)]2.  (4.28) 

From  (4.26)  for  the  Gaussian  procedure,  the  probability  of  an  incorrect 

classification 

P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{o(co)2(l-p2t)  in  (i_p2t)  <  (aW)2  (i_p2t)  _  gt(W,y(0))}. 
For  the  Cauchy  procedure,  the  probability  of  an  incorrect  classification 
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P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=  P{a2(l- 1  p  1 4)[1- 1  p  I *-l]  <  (1- 1  p  1 0  (aW)  -  gt(W/y(0))} 

=  P{a2(l- 1  p  I l)  I  p  I l  <  (1- 1  p  I l)  (aW)2  -  gt(W/y(0))} 

<  P{o(oo)2(i-p2t)  in  ( i_p2t)  <  (l- 1  p  1 1)  (aW)2  -  gt(W/y(0))} 

<P{a(oo)2(l-p2t)  ln(l-p2t)  <  (i_p2t)  (aW)2-gt(W/y(0))} 

for  t  sufficient^  small.  Thus,  for  small  t  the  Gaussian  procedure  will  tend  to 
have  more  incorrect  classifications  than  the  Cauchv  procedure  for  the 
scenario  of  Cauchv  data  with  the  Item  associated  with  the  Signature  at  time  t 
being  different  than  the  one  associated  with  the  signature  at  time  0.  This 
effect  is  made  stronger  bv  the  fact  that  if  the  Gaussian  procedure  is  used  then 

2  2 

an  estimate  of  o(<»  )  will  be  needed.  An  estimate  of  o(°°  )  for  Cauchv  data 
will  tend  to  be  verv  large  since  the  Cauchv  distribution  does  not  have  a  finite 
variance.   This  effect  will  be  seen  in  the  simulations  of  the  next  subsection. 

4.3  Results  of  simulation  experiments 

This  subsection  reports  on  results  of  simulation  experiments  to  assess  the 
behavior  of  the  Gaussian  and  Cauchv  classification  procedures  when  thev  are 
confronted  with  data  from  the  other  distribution.  For  simplicity  we  assume 
there  are  two  Items.  In  the  first  subsection  the  autoregressive  process 
producing  the  data  is  Gaussian.  In  the  second  subsection  the  autoregressive 
process  producing  the  data  is  Cauchv.  In  both  subsections  classification 
procedures  using  both  the  Cauchv  and  Gaussian  distributional  assumptions 
are  assessed.  In  all  cases  pi  =  P2  =  0.5,  6]  =  1,  02  =  2.  The  simulations  use  the 
LLRANDOM  random  number  generator;  cf.  Lewis  and  Uribe  [1981]. 
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a.     Gaussian  Data 

The  simulation  in  this  subsection  uses  data  from  a  Gaussian 
autoregressive  process.  We  will  assume  the  means  of  the  two  Signatures,  81 
and  62,  are  known  and  p^  =  P2  =  p  is  also  known.  It  remains  to  assess  values 
for  the  (presumed  known)  scale  parameters  of  the  two  classification 
procedures.  In  particular  what  should  the  scale  parameter  a  =  ai  =  (X2  of  the 
Cauchy  procedure  be  when  it  is  applied  to  Gaussian  data?  To  obtain 
reasonable  values  for  01  =  02  =  0  for  the  Gaussian  classification  procedure  and 
a  =  ai  =  0C2  for  the  Cauchy  classification  procedure,  the  following  simulation 
experiment  was  performed.  The  experiment  has  100  replications.  In  each 
replication  100  independent,  standard  normals  are  generated.  For  each 
replication,  the  standard  deviation  of  the  data  is  computed  and  the  maximum 
likelihood  estimate  of  a  is  obtained  numerically  assuming  a  Cauchy  density 
function  of  the  form 

f(x)  =  "^2    ,     —  <X<~. 

The  medians  of  the  100  estimates  of  a  and  the  100  standard  deviations  are 
calculated.  The  values  obtained  are  <7M  =1.0  a^  =0.607.  Note  that  the 
estimates  of  a  are  using  the  incorrect  model  assumption  of  Cauchy  for  the 
Gaussian  data.  The  value  of  o^  is  used  in  the  Gaussian  procedure  to  classify 
observations.   The  value  of  aM  is  used  in  the  Cauchy  procedure. 

Tables  1  and  2  show  results  for  simulation  experiments  with  500 
replications.    In  each  replication  Y(0)  is  generated  from  a  normal  distribution 


with  mean  61  and  standard  deviation  0(00)  =  a/Vl-p2  with  o=l  and  p  =  0.5. 
For  Table  1  Y(t)  is  generated  from  a  normal  distribution  with  mean 
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m(t)  =  e1+Pt(Y(o)-e1) 


and  standard  deviation  a(t)  =  a(°°W 1— p*;  namely  the  Signature  observed  at 
time  t  is  from  Item  1.  For  Table  2  Y(t)  is  generated  from  a  normal  distribution 
with  mean  02  and  standard  deviation  o(°o);  namely  the  Signature  at  time  t  is 
from  Item  2. 

In  both  Tables  the  Gaussian  classification  procedure  assumes  &m  =  1.0  is 
the  correct  standard  deviation.  The  Cauchy  classification  procedure  assumes 
ajvl  =  0.607  is  the  correct  value  for  a. 

The  values  in  Table  1  suggest  that  when  the  same  Item  is  producing  the 
Signature  at  time  0  and  t,  then  the  Gaussian  procedure  produces  more  correct 
classifications  for  small  time  t.  However,  the  number  of  correct  classifications 
is  the  same  for  both  procedures  for  larger  t. 

The  values  of  Table  2  suggest  that  if  a  different  Item  is  producing  the 
Signature  at  time  t,  then  the  Cauchy  classification  procedure  has  more  correct 
classifications  at  time  t  for  small  t  even  though  the  data  are  Gaussian.  For 
larger  t,  both  procedures  have  the  same  number  of  correct  identifications. 

h.     Cauchy  Data 

In  this  subsection  the  data  arise  from  a  Cauchy  autoregressive  process. 
The  mean  Signatures  of  the  two  Items,  8]  and  62,  are  assumed  known  and  p  = 
Pi  =  P2  is  also  assumed  known.  It  remains  to  assess  values  for  the  scale 
parameters  of  the  Gaussian  and  Cauchy  classification  procedures.  In 
particular,  what  should  the  scale  parameter  o  =  01  =  02  of  the  Gaussian 
procedure  be  when  it  is  applied  to  Cauchy  data? 
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TABLE  1.  GAUSSIAN  DATA 

Item  1  Produces  the  Signature  at  time  0 
Item  1  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

t 

Gaussian  Proc. 

Cauchy  Proc. 

Gaussian  Correct 
/Cauchy 
Incorrect 

Gaussian 

Incorrect/Cauchy 

Correct 

1 

0.77 

0.65 

50 

0 

2 

0.68 

0.67 

5 

0 

5 

0.67 

0.67 

0 

0 

10 

0.70 

0.70 

0 

0 

TABLE  2.  GAUSSIAN  DATA 

Item  1  Produces  the  Signature  at  time  0 
Item  2  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

Gaussian  Proc. 

Cauchy  Proc. 

Gaussian  Correct 
/Cauchy 
Incorrect 

Gaussian 

Incorrect/Cauchy 

Correct 

1 

0.64 

0.71 

0 

38 

2 

0.69 

0.71 

0 

6 

5 

0.64 

0.64 

1 

0 

10 

0.68 

0.68 

0 

0 

To  obtain  reasonable  values  for  a  for  the  Gaussian  classification 
procedure  and  a  for  the  Cauchy  classification  procedure,  the  following 
simulation  experiment  was  performed.  The  experiment  has  100  replications. 
Each  replication  generates  100  standard  Cauchy  random  numbers.  For  each 
replication  the  standard  deviation  of  the  data  is  computed  and  the  maximum 
likelihood  estimate  of  a  is  obtained  numerically.    The  medians  of  the  100 


30 


estimates  of  a  and  the  130  standard  deviations  are  computed.    The  values 
obtained  are 

&M  =  13.23  and  oc^=  1.03. 

Note  the  high  value  of  the  standard  deviation. 

Tables  3  and  4  present  results  of  simulation  experiments  in  which  the 
data  are  from  a  Cauchy  autoregressive  process.  All  experiments  have  500 
independent  replications.  For  each  replication  Y(0)  is  generated  from  a 
Cauchy  distribution  with  location  parameter  6i,  and  scale  parameter  1;  that  is, 
Item  1  is  producing  the  Signature  at  time  0.  For  replications  reported  in  Table 
4,  Y(t)  is  generated  from  a  Cauchy  distribution  with  location  parameter  62  and 
scale  parameter  1;  that  is,  Item  2  is  producing  the  Signature  at  time  t.  For 
replications  reported  in  Table  3,  Y(t)  is  generated  from  a  Cauchy  distribution 
having  density  function 

«  x     l  a(t) 

nx;  "  k  a(t)2  +  (x-m(t))2 

with 

m(t)  =  91+pKy(0)-e1) 

and 

cc(t)  =  (l-|pf); 

that  is,  Item  1  is  also  producing  the  Signature  at  time  t. 

In  both  Tables  3  and  4,  the  Gaussian  classification  procedure  assumes  a 
standard  deviations  01=02  -^M-  The  Cauchy  classification  procedure 
assumes  the  a-parameters  ai  =  0:2  =  «m  . 

The  results  of  Table  3  indicate  that  for  small  times  t,  if  the  same  Item  is 
producing  the  Signature  at  time  0  and  time  t,  then  the  Gaussian  classification 
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procedure  has  more  correct  classifications  even  though  the  data  are  Cauchy. 
For  larger  times  t,  the  number  of  correct  classifications  is  the  same  for  both 
procedures.  On  the  other  hand,  the  results  of  Table  4  indicate  that  if  a 
different  item  is  producing  the  Signature  at  time  t,  then  the  Cauchy 
classification  procedure  has  many  more  correct  classifications  than  the 
Gaussian  procedure  for  small  times  t.  Once  again  the  number  of  correct 
identifications  is  the  same  for  both  procedures  as  t  becomes  larger. 

TABLE  3.  CAUCHY  DATA 

Item  1  Produces  the  Signature  at  time  0 
Item  1  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

Gaussian  Proc. 

Cauchy  Proc. 

Gaussian  Correct 
/Cauchy 
Incorrect 

Gaussian 

Incorrect/Cauchy 

Correct 

1 

0.98 

0.73 

124 

0 

5 

0.72 

0.69 

18 

0 

10 

0.66 

0.66 

0 

1 

TABLE  4.  CAUCHY  DATA 

Item  1  Produces  the  Signature  at  time  0 
Item  2  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

Normal  Proc. 

Cauchy  Proc. 

Normal  Correct 
/Cauchy 
Incorrect 

Normal 

Incorrect/Cauchy 

Correct 

1 

0.09 

0.74 

0 

316 

2 

0.14 

0.70 

0 

282 

5 

0.67 

0.61 

0 

29 

10 

0.64 

0.64 

0 

0 
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c.      Summary 

The  differences  in  performance  of  the  two  classification  procedures 
appear  for  small  time  t.  If  the  same  Item  is  producing  Signatures  at  both  0 
and  t,  then  the  Gaussian  classification  procedure  has  more  correct 
classifications  for  small  times  t  for  both  Gaussian  and  Cauchy  data.  If  a 
different  Item  is  producing  the  Signature  at  time  t,  then  the  Cauchy 
classification  procedure  has  more  correct  classifications  for  both  Gaussian  and 
Cauchy  data.  The  effect  is  strongest  if  the  data  are  from  a  Cauchy 
autoregressive  process;  in  this  case  the  Gaussian  procedure  does  very  poorly 

when  different  Items  are  producing  the  Signatures. 

In  summary,  it  is  important  to  realize  that  the  performance  of  a  Bayesian 
classification  procedure  can  be  influenced  by  its  underlying  distributional 
assumptions.  A  classification  procedure  based  on  Gaussian  distributional 
assumptions  can  be  reluctant  to  classify  a  new  observation  coming  from  a 
different  item  as  being  associated  with  a  new  item.  A  classification  procedure 
based  on  Cauchy  distributional  assumptions  can  be  reluctant  to  classify  a  new 
observation  which  comes  from  the  same  item  as  that  being  associated  with 
the  same  item.  Hence,  if  there  is  uncertainty  about  the  underlying 
distribution  of  the  data,  it  might  be  better  to  combine  results  of  several 
classification  procedures  based  on  different  distributional  assumptions. 
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54.     Prof.  William  Jewell 

Operations  Research  Department 
University  of  California,  Berkeley 
Berkeley,  CA    94720 


55.  Prof.  D.  C.  Siegmund 

Dept.  of  Statistics,  Sequoia  Hall 

Stanford  University 
Stanford,  CA    94305 

56.  Operations  Research  Center,  Rm  E40-164 

Massachusetts  Institute  of  Technology 

Attn:  R.  C.  Larson  and  J.  F.  Shapiro 
Cambridge,  MA    02139 

57.  Arthur  P.  Hurter,  Jr 

Professor  and  Chairman 

Dept.  of  Industrial  Engineering  and  Management  Sciences 
Northwestern  University 
Evanston,  IL    60201-9990 

58.  Institute  for  Defense  Analysis 

1800  North  Beauregard 

Alexandria,  VA    22311 

59.  Prof.  J.  W.  Tukey 

Statistics  Dept.,  Fine  Hall 

Princeton  University 
Princeton,  NJ    08540 
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60.     Dr.  Daniel  H.  Wagner 
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PaolLPA    19301 


61.  Dr.  Colin  Mallows 

AT&T  Bell  Telephone  Laboratories 
Mountain  Avenue 

Murray  Hill,  NJ    07974 

62.  Dr.  D.  Pregibon 

AT&T  Bell  Telephone  Laboratories 
Mountain  Avenue 

Murray  Hill,  NJ    07974 

63.  Dr.  Jon  Kettenring 

Bellcore 

435  South  Street 

Morris  Township,  NJ     07960 

64.  Prof.  David  L.  Wallace 

Statistics  Dept.,  University  of  Chicago 
5734  S.  University  Ave. 

Chicago,  IL   60637 

65.  Dr.  S.  R.  Dalai 

AT&T  Bell  Telephone  Laboratories 
Mountain  Avenue 

Murray  Hill,  NJ    07974 

66.  Dr.  M.  J.  Fischer 

Defense  Communications  Agency 
1860  Wiehle  Avenue 

Reston,  VA   22070 

67.  Dr.  Prabha  Kumar 

Defense  Communications  Agency 

1860  Wiehle  Avenue 
Reston,  VA    22070 


68.     Dr.  B.  Doshi 

AT&T  Bell  Laboratories 
HO  3M-335 
Holmdel,  NJ    07733 
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69.  Dr.  D.  M.  Lucantoni 

AT&T  Bell  Laboratories 
Holdmel,  NJ    07733 

70.  Dr.  V.  Ramaswami 

MRE  2Q-358 

Bell  Communications  Research,  Inc. 
435  South  Street 
Morristown,  NJ    07960 

71.  Prof.  G.  Shantikumar 

The  Management  Science  Group 
School  of  Business  Administration 
University  of  California 
Berkeley,  CA    94720 

72.  Dr.  D.  F.  Daley 

Statistic  Dept.  (I.A.S.) 
Australian  National  University 
Canberra,  A.C.T.    2606 
AUSTRALIA 

73.  Dr.  Guy  Fayolle 

I.N.R.I.A. 

Dom  de  Voluceau-Rocquencourt 
78150  Le  Chesnay  Cedex 
FRANCE 

74.  Professor  H.  G.  Daellenbach 

Department  of  Operations  Research 
University  of  Canterbury 
Christchurch,  NEW  ZEALAND 

75.  Koh  Peng  Kong 

OA  Branch,  DSO 

Ministry  of  Defense 
Blk  29  Middlesex  Road 
SINGAPORE    1024 


76.     Professor  Sir  David  Cox. 
Nuffield  College 
Oxford,  OXI  INF 
ENGLAND 
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77.     Dr.  A.  J.  Lawrence 

Dept.  of  Math  Stat.,  Univ.  of  Birmingham 
P.  O.  Box  363 
Birmingham  B15  2TT 
ENGLAND 


78.  Dr.  F.  P.  Kelly 

Statistics  Laboratory 
16  Mill  Lane 
Cambridge 
ENGLAND 

79.  Dr.  R.  J.  Gibbens 

Statistics  Laboratory 
16  Mill  Lane 
Cambridge 
ENGLAND 


80.  Dr.  John  Copas 

Dept.  of  Math  Stat.,  Univ.  of  Birmingham 

P.  O.  Box  363 
Birmingham  B15  2TT 
ENGLAND 

81.  Dr.  D.  Vere-Jones 

Dept.  of  Math,  Victoria  Univ.  of  Wellington 
P.  O.  Box  196 

Wellington 
NEW  ZEALAND 

82.  Prof.  Guy  Latouche 

University  Libre  Bruxelles 

C.  P.  212,  Blvd.  De  Triomphe 
B-1050  Bruxelles 
BELGIUM 

83.  Naval  Ocean  Systems  Center 
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